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Abstract 

In this paper, we establish a fundamental connection between binomial parameters and 
means of bounded random variables. Such connection finds applications in statistical inference 
of means of bounded variables. 

1 A Fundamental Identity 

Statistical inference of means of bounded random variables is frequently encountered in many 
areas of sciences and engineering. Since there exist rich techniques for the statistical inference 
of probability of events based on binomial trial models, it is useful to express means of bounded 
random variables in terms of probability of events. In this regard, we have derived a fundamental 
identity in the following theorem. 

Theorem 1 Let X be a bounded random variable such that < X < c. Let Y be a bounded 
random variable such that a < Y < b. Let U be a random variable with a uniform distribution 
over [0, 1]. Suppose that U is independent with X and Y . Then, 

E[X] = E[Y] + 5Pr{X > Y + SU} + |c - a - S\ Pr{X >Y + S + \c-a- 8\U} - bPr{X <bU <Y} 
for any real number S. 



In Theorem [IJ if we set Y = c% and 5 = C2 — c\ with constants c\ and c 2 , then we have 

Corollary 1 Let X be a bounded random variable such that < X < c. Let < c\ < C2 < c. 
Let U be a random variable with a uniform distribution over [0,1]. Suppose that X and U are 
independent. Then, 

E[X] = Cl + ( C2 - Cl )Pr{X > c 1 + {c 2 -c 1 )U} + (c-c 2 )Pi{X > c 2 + (c- c 2 )U} - c x Vr{X < Cl U}. 
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In Corollary 1, setting c\ = and C2 = c, we have 

Corollary 2 Let X be a bounded random variable such that < X < c. Let U be a random 
variable with a uniform distribution over [0, 1] . Suppose that X and U are independent. Then, 



E[X] = cPr{X > cU}. 



2 Applications 



The identity in Theorem Q] can be used to construct confidence intervals for means of bounded 
random variables. Specifically, for the problem of estimating the mean of random variable X £ 
[0, c], we can generate n random tuples (Xi, Yi,Ui), i = 1, • • • ,n such that 

(i) Xi are i.i.d. random samples of X; 

(ii) Yi are i.i.d. random samples of Y; 

(iii) Ui are i.i.d. random samples uniformly distributed over [0, 1]; 

(iv) Ui is independent of any tuple (Xi, Yi). 

In terms of these samples, we can define Bernoulli random variables 



1 forXi > Yi + SUi, 
otherwise 



1 forXj >Y i + S+\c-a-S\U i , 
otherwise 



Ci 



1 for Xi <bU t < Yi, 
otherwise 

for i = 1, • • ■ , n. Clearly, Ai, i = 1, • ■ ■ , n are i.i.d. Bernoulli random variables with parameter 

Pa = Pr{X > Y + 5U); 
Bi, i = 1, • • • ,n are i.i.d. Bernoulli random variables with parameter 

p b = Pr{X >Y + 5+\c-a- 5\U}; 
and Ci, i = 1, ■ • • ,n are i.i.d. Bernoulli random variables with parameter 

Pc = Pv{X <bU < Y}. 
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Now define 

n n n 

i=l i=l i=l 

Then, K a , Kb, K c are binomial random variables with parameters p a , pb and p c respectively. 
Applying the identity in Theorem [H we have 

E[X] = K[Y] + 6 Pa + \c - a - 5\ Pb - b p c , 

which is the desired link between binomial parameters and means of bounded random variables. 
Assuming that E[Y] is available without sampling, we can estimate E[X] by means estimating 
p a , p and p c . Clearly, the minimum variance unbiased estimates of p a , Pb, Pc are respectively 

Ka Kb Kc 

n ' n ' n 
Accordingly, an unbiased estimate of E[A] is 

E[Y]+S^ + \c-a-S\^-b^. 

n n n 

To construct a confidence interval for M[X] with confidence coefficient 1 — a where < a < 1, 
we can first construct three confidence intervals employing the classical method of Clopper and 
Pearson [2] such that 

_ a _ a _ a 

Pr{p a <p a <p a \ > 1 - -, Pr{p fo <p b <p b \ > 1 - -, Pr{p c < p c < p c } > 1 - -. 

Then, we can apply Bonferroni's inequality to obtain a confidence interval [/x, ~p] such that 

/x = E[Y] + 5p a + \c - a - 5\p b -bpZ, JZ = E[Y] + Sp a + \c - a - 5\p b - 6/x 

and 

Pr{/x < E[X] < Jt} > 1 - a. 

Here we have assumed S > 0, b > in defining fi and JZ. The construction of confidence interval 
is similar for other signs of 5 and b. 

In a similar spirit, we can construct confidence interval for M[X] by using the identity in Corol- 
lary 1 and Bonferroni's inequality. In that situation, we need to generate n tuples (Xi,Ui), i = 
1 , • • • , n such that 

(i) Xi are i.i.d. random samples of X; 

(ii) Ui are i.i.d. random samples uniformly distributed over [0, 1]; 

(iii) Ui is independent of any X{. 



3 



When using the identity in Corollary 2 to construct confidence interval for E[X], we don't 
need to apply Bonferroni's inequality. The confidence interval can be obtained by scaling the 
confidence interval of the corresponding binomial parameter. 

Applying the identity in Corollary 2, we can also solve the following sample size problem: 

Suppose X is a random variable bounded in (0, c). How large the sample size should be to 
ensure that the estimate ofM[X] is close to ELY] within error margin e with a confidence level at 
least 1 — a? 

By using the link we established at above, we can determine the sample size as follows. 

(i) Determine the minimum sample size N such that the minimum variance unbiased estimate of 

any Bernoulli parameter is close to its true value within error margin e = - with a confidence 
level at least 1 — a. An exact method has recently been developed by Chen pp. 

(ii) Generate n tuples (Xi, Ui), i = 1, • • • , n such that Xi are i.i.d. random samples of X; Ui are 

i.i.d. random samples uniformly distributed over [0, 1]; and Ui is independent of any Xi. 

(iii) Return 

/x = — With Ai = < 

^ I otherwise 

as the estimate for [i = ELY]. 
Then, the estimate guarantees 

Pr{|/x - fj,\ < e} > 1 — a. 

Finally, we would like to point out that, by using the link between the binomial parameters 
and means of bounded random variables, it is possible to transform the hypothesis testing problem 
of a bounded variable mean as the problem of testing a binomial proportion. 

3 Conclusion 

In this paper, an inherent connection between binomial parameters and means of bounded random 
variables is established. Such connection can be useful for the estimation and hypothesis testing 
of the means of bounded random variables. 

4 Proof of Theorem 1 

Throughout the proof, we shall use Riemann-Stieltjes integration. Let Fx.y(-) be the jointed 
cumulative distribution function of random variables X and Y. Let Z = Y + 5 and v = a + 5. 
We need some preliminary results. 
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Lemma 1 



I xdF x , Y = E[Y]+8 Pr{Z >X >Y + 8U}- f y dF X y - [ ydF x , Y - 

J V£. x< ^ z Jx>z J x<y 



Proof. We claim that 



I xdF x ,Y = 5 Vi{Z>X>Y + 5U}+ I ydF x ,y 
J y< x '£ z J y< x < z 

To show the claim, we need to consider three cases: 5 = 0, 5 < and 5 > 0. 
In the case of 5 = 0, we have 

Z = Y, {Y < X < Z} = {Y < X < Y} = {Y = X}, 5 Pr{Z > X >Y + 5 U} = 0. 



Hence, 



/ xdF x , Y = [ 

J y<x<z J 11= 



x dF x ,Y = I x dF x ,y = I V dF XyY = y dF X y 

' VS X S Z J y=x J y=x J y<x<z 

and the claim (JTJ) is true for 5 = 0. 
In the case of 5 < 0, we have 

Z < Y, {Y < X < Z} = 0, 
< Pr{Z >X>Y + 5U}< Pr{Z >Y + 5U} = Pr{<5 > 5 U} = Pr{[7 > 1} = 0. 

and 

/ x dF Xy y = y dF X y = 0. 

J y < k x< i^ z J y< x < z 

It follows that the claim ([1]) is true for 5 < 0. 

In the case of 5 > 0, using the assumption that U is independent with X and Y, we have 

5 Pr{Z >X>Y + 5U} = 5 Pr{Z >X>Y, Z>X>Y + 5U} 

= 5 1 du dF X Y 

" z>x>y + <5 u, 
0<u<l 



y<x<z 



du 



u=Q 



dF x ,Y 



y<x<z 



y<x<z 



( , Z - , 1 ) dF X ,Y 



x-y 



dF x . 



Y 



I x dF Xy y - y dF X)Y . 

J v<x<z J v<x<z 



1 y^ x< k z J y < k x< ^ z 

Rearranging this equation shows that the claim ([!]) is true for 5 > 0. 
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Finally, the proof of the lemma is completed by applying the established claim (fTJ) and the 
observation that 

/ y dF x , Y =E[Y]- f y dF x , Y - [ y dF X y. 

J V^: x< k z Jx>z J x<y 

□ 



Lemma 2 



/ x dF x ,Y = \c-v\ Pr{X > Z+ \c-v\ U} + / z dF x ,y 

Jx>z Jx>z 

Proof. To show the lemma, we need to consider two cases: c — v < and c — v > 0. 
In the case of c — v < 0, we have X<c<a + 5<Z. Hence, 



{X>Z} 



{X > Z + \c - v\ U} 



/ xdF x , Y = 0= zdF xx , Pi{X > Z+ \c-v\ U} = 0. 

J x>z J x>z 



C>2 J X> 

It follows that the lemma is true for c — v < 0. 
In the case of c — v > 0, note that 



|c- t;| Pr{X > Z+ \c-v\ U) = |c - v\ Pr{X > Z, X > Z + |c - v\ U} 



\c — v\ I du dFv y 

x>z, ' 1 

x~> z-{-\c — v\ u, 
0<u<l 



c — V 



c — V 



c — v\ 



x>z 



mill 



u=0 



\c-v\ 



1 



du 



dF x , 



Y 



x>z 



x>z 



X — Z 
I ■ r , 1 ) dF X y 



c — v\ 



x — z 



c — v\ 



dF xx 



/ x dF x>Y - z dF X)Y . 

Jx>z Jx>z 



Rearranging the last equation shows that the lemma is true for c — v > 0. This competes the 
proof of the lemma. □ 



Lemma 3 




x dF X)Y = b Pr{Y > X>bU} 
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Proof. To show the lemma, we need to consider two cases: b < and b > 0. 
In the case of b < 0, we have X > > b > Y. Hence, {Y > X} = and 

< Pr{Y > X > b U] < Pr{y > X} = 0, f x dF x ,Y = 

J x<y 

So the lemma is true for b < 0. In the case of b > 0, note that 



b Pr{T > X > b U} 



b I du dFxy 

' bu<-x<.y . 
Q<u<l 



x<y 



x<y 



x<y 



1) 



du 



u=0 



dF x , 



Y 



min {^-, 1 ) dFx,y 
o 



Y 



x<y 



for 6 > 0. This competes the proof of the lemma. 



Lemma 4 



J y dF x ,Y = b Pr{Y > X, Y >bU} 

J x<y 



Proof. To show the lemma, we need to consider two cases: b < and b > 0. 
In the case of b < 0, we have X > > b > Y. Hence, {Y > X} = and 

< Pr{Y > X, Y >bU}< Pr{Y > X} = 0, / x dF Xj y = 0. 

J x<y 

So the lemma is true for b < 0. In the case of b > 0, note that 



b Pr{Y > X, y > b U} 



b I du dFx y 

x<y, 
0<u<l 



x<y 



mm 



«=o 



dF 



X.Y 



y 



b J min (^' 1 ) dF X,Y 
b - dF x ,Y = V dFxy 

J x<v J x<v 



This competes the proof of the lemma. 



Now we are in a position to prove Theorem 1. Observing that 

E[X] = I xdF x ,y+ / xdF x ,Y+ / x dF x ,y 

J V<x<z J x>z J x<y 

and using Lemmas [Q [3l we have 

E[X] = E[Y] + 5 Pr{Z >X>Y + 5U}+[ (z - y) dF X y 

J x>z 

+\c - v\ ¥t{X > Z + \c- v\U} + 6Pr{y > X > bU} - ! y dF xy . 

J x<y 

By virtue of 

I (z-y) dF xx = 5 Pi{X > Z} 

J x>z 

and Lemma HI 

E[X] = E[Y] +6Pv{Z > X > Y + 517} +SPv{X > Z} + |c- v\Pr{X > Z+ |c - v\U} 

+ bPr{Y > X >bU}-bPr{Y > X, Y >bU}. (2) 



Noting that 
and that 
we have 
Noting that 
and that 
we have 
Hence, 



{Z > X > Y + 6 U} U {X > Z} = {X > Y + 5 U} 
{Z > X > Y + 6 U} n [X > Z} = 0, 
5 Pt{Z >X>Y + 6U} + S Pr{X > Z} = 5 Pr{X > Y + S U}. (3) 
{Y > X > b U} n {X < b U < Y} = 
{Y > X > b U} U {X < b U < Y} = {Y > X, Y >bU}, 
Pi{Y > X >bU} + Pr{X <bU <Y} = Pr{Y > X, Y >bU}. 



b Pt{Y > X>bU} - b Pt{Y > X, Y >bU} = -b Pr{X <bU <Y}. (4) 
Finally, by ©, © and g]), wc have 

ELY] = E[Y] + 5Pr{X >Y + 5U} + \c — v\ Pr{X > Z + \c - v\U} - bPr{X < bU < Y}. 
The proof of Theorem 1 is thus completed. 
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